Mergent C Omplexity via M Ulti - a Gent

نویسندگان

  • Trapit Bansal
  • Jakub Pachocki
  • Szymon Sidor
چکیده

Reinforcement learning algorithms can train agents that solve problems in complex, interesting environments. Normally, the complexity of the trained agent is closely related to the complexity of the environment. This suggests that a highly capable agent requires a complex environment for training. In this paper, we point out that a competitive multi-agent environment trained with self-play can produce behaviors that are far more complex than the environment itself. We also point out that such environments come with a natural curriculum, because for any skill level, an environment full of agents of this level will have the right level of difficulty. This work introduces several competitive multi-agent environments where agents compete in a 3D world with simulated physics. The trained agents learn a wide variety of complex and interesting skills, even though the environment themselves are relatively simple. The skills include behaviors such as running, blocking, ducking, tackling, fooling opponents, kicking, and defending using both arms and legs. A highlight of the learned behaviors can be found here: https://goo.gl/eR7fbX.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mergent T Ranslation in M Ulti - a Gent C Ommunication

While most machine translation systems to date are trained on large parallel corpora, humans learn language in a different way: by being grounded in an environment and interacting with other humans. In this work, we propose a communication game where two agents, native speakers of their own respective languages, jointly learn to solve a visual referential task. We find that the ability to under...

متن کامل

O Rganizational M Ulti - a Gent S Ystems : a P Rocess D Riven a Pproach

There exists a need in industrial and business applications to intelligently integrate data, information and knowledge from a diverse range of sources, particularly during product design, or policy formation. Optimizing decision making requires the expertise of many agents, both computational and human, to be combined and coordinated. The concept of an ’organization’ has emerged as central to t...

متن کامل

Nformation S Ecurity a Pproach in O Pen D Istributed M Ulti - a Gent V

This paper presented the main information, security problems and threats in open multi-agent distributed e-learning information systems and Proposed various approaches to solve information security attacks in virtual learning environment using service oriented architecture which based on multi-agent information systems architecture, the solution on the multi-agent learning information system im...

متن کامل

Cyclin-dependent kinases and cell division in plants- the nexus

Vladimir Mironov, a,b Lieven De Veylder, a Marc Van Montagu, a and Dirk Inzé a,b,c,1 a Laboratorium voor Genetica, Departement Plantenggenetica, Vlaams Interuniversitair Instituut voor Biotechnologie, Universiteit Gent, K.L. Ledeganckstraat 35, B-9000 Gent, Belgium b CropDesign N.V., Technologiepark 3, B-9052 Zwijnaarde, Belgium c Laboratoire Associé de l’Institut National de la Recherche Agron...

متن کامل

S Tudents ’ P Erformance P Rediction S Ystem Using M Ulti a Gent Data M Ining T Echnique

A high prediction accuracy of the students’ performance is more helpful to identify the low performance students at the beginning of the learning process. Data mining is used to attain this objective. Data mining techniques are used to discover models or patterns of data, and it is much helpful in the decision-making. Boosting technique is the most popular techniques for constructing ensembles ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018